Information filtering via preferential diffusion 
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Recommender systems have shown great potential to address information overload problem, 
namely to help users in finding interesting and relevant objects within a huge information space. 
Some physical dynamics, including heat conduction process and mass or energy diffusion on net- 
works, have recently found applications in personalized recommendation. Most of the previous 
studies focus overwhelmingly on recommendation accuracy as the only important factor, while over- 
look the significance of diversity and novelty which indeed provide the vitality of the system. In 
this paper, we propose a recommendation algorithm based on the preferential diffusion process on 
user-object bipartite network. Numerical analyses on two benchmark datasets, MovieLens and Net- 
fiix, indicate that our method outperforms the state-of-the-art methods. Specifically, it can not only 
provide more accurate recommendations, but also generate more diverse and novel recommendations 
by accurately recommending unpopular objects. 

PACS numbers: 89. 75. He, 89.20.Ff, 05.70.Ln 
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I. INTRODUCTION. 



The development of information technology brings 
great impact on human society. Therein, the most sig- 
nificant aspect is the revolutionary change in the ways 
of life. Twenty years ago, if one wants to buy some- 
thing, he/she has to personally go to a physical shop and 
purchase, and then bring the things back home. It is im- 
possible for him/her to compare the commodities in dif- 
ferent markets located at different places in a short time. 
Now with the growth of the Internet and World Wide 
Web, we can almost manage our life at home. When 
we want to buy a book, we don't need to go to book- 
stores any more to find it on bookshelves one by one, 
instead what we need to do is typing the title of this 
book on the website of Amazon — an online retailer of 
books. If we want to buy a cell phone, we can compare 
the prices on different web-shops at the same time with- 
out any transportation fee. Formerly, we usually go to 
a bar after working and enjoy making friends there, now 
we prefer online dating that allows us to reach people 
over the world. In a word, the Internet benefits us by 
providing a much more convenient way to get what we 
want. However, as a coin has two sides, Internet also 
brings us confusion — we face information overload. As 
we know, not all the online information are good or true 
or favorable by surfers. Therefore, we need to distin- 
guish and select between valuable information and junks. 
In this sense, to get what we want or the most satisfied 
things becomes more and more difficult, since we face 
much more choices than before. A useful information fil- 
tering technology is search engine by which users 
can find the relevant information with properly chosen 
keywords or tags. However, search engines have two dis- 
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advantages which limit their applications. Firstly, they 
lack the consideration of personalization and thus return 
the same results to people no matter what their prefer- 
ences are. Secondly, search engines require the users to 
know exactly what they want and extract some proper 
keywords to do the searching. However, sometimes the 
tastes or preferences can not be easily expressed by key- 
words or the users don't even know what they want at 
all. In these cases, the search engines are of no avail. 

To address these problems, recommender systems rise 
in response to the proper time and conditions, which 
form or work from a specific type of information fil- 
tering technique that attempts to recommend informa- 
tion items, such as movies, TV programs, videos, music, 
books, news, images and web pages, that are likely to 
be of interest to the users. The recommender systems 
don't require specified keywords provided by users, in- 
stead they use the users' historical activities and possi- 
ble personal profiles to uncover their preferences or po- 
tential interests. Many recommendation algorithms have 
been developed, including collaborative filtering (CF) 0- 

, content-based analysis Q , spectral analysis [3, ll] and 
iterative self-consistent refinement [1, [13] ■ What most 
have in common is that they are based on similarity, ei- 
ther of users or objects or both. Such approach is under 
high risk of providing poor coverage of the space of rele- 
vant items. As a result, with recommendations based on 
similarity rather than difference, more and more users 
will be exposed to a narrowing band of popular objects. 
Although it seems more accurate to recommend popu- 
lar objects than niche ones, being accurate is not enough 
It was pointed out that the recommendations that 
are most accurate are sometimes not the recommenda- 
tions that are useful to users. For example, would you 
use such a system that recommends the movies you in- 
deed like but have seen before or just watched in the 
cinema? Diversity and novelty are also important cri- 
teria of algorithmic performance. A possible way to in- 



2 



crease the recommendation diversity is utiUzing the tags 
of objects [H-[l3l- Another promising way is consider- 
ing the dissimilar users' contribution. It was shown that 
under the framework of collaborative filtering the dissim- 
ilar users can contribute to both accuracy and diversity 
of personalized recommendation [l5l |. However, these im- 
provements are very limited. 

Recently, some physical dynamics, including mass dif- 
fusion [l6|,[i7| and heat conduction process jl8(| have been 
applied to design recommender systems. Zhou et al. pro- 
posed a network-based inference method (NBI) by con- 
sidering the three-step mass diffusion starting from the 
target user on a user-object bipartite network This 
method has been demonstrated to be more accurate than 
the classical CF algorithm while with lower computa- 
tional complexity. However, it has difficulty in generating 
diverse recommendations. The heat conduction process 
has been found its effectiveness in providing a diverse 
recommendation at the cost of accuracy. This diversity- 
accuracy dilemma can be effectively solved by coupling 
these two processes [l^ . It was shown that not only does 
the hybrid algorithm outperform other methods but that, 
without relying on any semantic or context-specific infor- 
mation, it can be tuned to obtain significant gains in both 
accuracy and diversity of recommendations. 

With the same motivation, we proposed an algorithm 
based on a preferential mass diffusion process on user- 
object bipartite networks, without consideration of heat 
conduction which may stealthily hurt accuracy. Numer- 
ical analyses on two benchmark datasets show that our 
method can give higher accurate as well as more diverse 
and novel recommendations than the hybrid algorithm, 
because of its high accurate recommendations on low- 
degree objects. 

II. PREFERENTIAL DIFFUSION METHOD 

A recommender system can be represented by a bi- 
partite network G{U,0, E), where U = {wi, M2, Um}, 
O = {oi, 02, o„} and E = {ei, 62, e^} are the sets of 
users, objects and links respectively @. Denote by Amxn 
the adjacency matrix, where the element equals 1 if 
Ui has collected object Oq, and otherwise. 

The essential task of a recommender system is to gener- 
ate a ranking list of the target user's uncollected objects. 
The original diffusion-based recommendation algorithm, 
called network-based inference (NBI), was proposed in 
Ref. fl^. It was referred as ProbS algorithm in Ref. 
p^ . NBI works by assigning objects an initial level of 
resource denoted by the vector f (where /„ is the re- 
source possessed by object Oq), and then redistributing 
it via the transformation f = Wf, where 

/5 1=1 ' 

is the resource transfer matrix, and /cq^ = Y^7=i '^iP ^'^'^ 



= '^h denote the degrees of object and user 

ui respectively. For a target user u^, we assign one unit 
resource on those objects already collected by Ui for sim- 
plicity, thus the initial resource vector f can be written 
as 

fa — ^ia ■ (2) 

That is to say, if object Oa is collected by user Ui then 
it has one unit resource, otherwise 0. With this initial 
resource vector, the result of NBI is equivalent to a three- 
step random walk process starting from the target user on 
a bipartite network [20]. Note that, if the initial resource 
vector is normalized by the target user's degree, namely 
fa = o,ia/kui, the rcsults are exactly the same. In fact, 
the process of NBI is equivalent to resource-allocation 
which is also a random-work-based process. Given the 
initial resource distribution as shown in Eq. [5J the re- 
source of each object will be redistributed according to 
EqU] where Wap indicates how many proportion of re- 
source that object a gives to object /3. Then after the 
resource-allocation process, we obtain the final resource 
possessed by each object by summing up all the resources 
distributed from other objects. The recommendation list 
for user Ui is generated by ranking all his/her uncollected 
objects in decreasing order according to their final re- 
source. 

A heterogenous initial resource distribution NBI algo- 
rithm (abbreviate as Heter-NBI) was proposed by Zhou 
et al. 21], where the initial resource of object Oq is 
proportional to fc^^ .Thus the initial resource vector of 
Heter-NBI can be written as fa — aia ■ k^a where 9 is 
a negative parameter. It was shown that Heter-NBI can 
give more accurate recommendations than the standard 
NBI. There are other two advanced recommendation al- 
gorithms. One is an improved algorithm by eliminating 
redundant correlations (called RE- NBI for short) }22| . 
which is defined as 

f = {W + r]W^){, (3) 

where the elements of matrix W are defined by Eq. [U 
the initial resource vector f is defined by Eq. [2] and i] is a 
free parameter. This method has been approved to out- 
perform some classical methods, such as the global rank- 
ing method, the cosine-similarity-based collaborative fil- 
tering [23], NBI and Heter-NBI for both accuracy and 
diversity by considering the high-order correlations be- 
tween objects. The other method, referred as Hybrid-PH 
in this paper, is proposed in Ref. ||19] , which is a hybrid 
algorithm combining the HeatS (i.e., heat conduction) 
and ProbS (i.e., mass diffusion) by incorporating the hy- 
bridization parameter A into the transition matrix nor- 
malization: 

where A = gives the pure HeatS algorithm, and A = 1 
gives the ProbS (i.e., NBI). 
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Based on mass diffusion method and motivated by en- 
hancing the algorithm's abihty to find mipopular and 
niche objects, we propose a preferential diffusion (PD) 
method for recommendation in user-object bipartite net- 
works. The basic idea is that at the last step (i.e., diffus- 
ing from users to objects), the amount of resource that 
an object Oq. received is proportional to , where e < 
is a free parameter. In this case, the resource transfer 
matrix reads 



1 



h ■ h~' 



M 



(5) 



1=1 



where M = YJ^=i O'lrK,^ = kui ■ MairK^)- ^{airk^J in- 
dicates the mean value of k^^ over all the objects having 
been collected by user m;. Here we consider the sim- 
plest initial resource vector defined by Eq. [21 Clearly, 
when £ = it will degenerate to NBI. Notice that, 
if we consider the NBI algorithm as a three step dif- 
fusion starting from target user to final objects (i.e., 
user^object— Tenser— )-object), then the Heter-NBI algo- 
rithm is essentially equivalent to the algorithm with pref- 
erential diffusion only at the first step, while PD consid- 
ers the third step. However, their motivations are es- 
sentially different. Heter-NBI emphasizes that users who 
co-collected unpopular objects are more similar to each 
other than those co-collected popular objects. And thus 
the target user distributes more resource to his/her more 
similar users by giving more resource to their co-collected 
unpopular objects. However, after the third step diffu- 
sion the resource still can be centralized on some pop- 
ular objects. The PD algorithm directly punishes the 
popular object by assigning more resource to the low- 
degree objects at the last step. Experimental results 
show that considering the preferential diffusion at the 
last step is much more effective than at the first step. 
In order to show that preferential diffusion at first step 
(i.e., Heter-NBI) and at last step (i.e., PD) play different 
roles in recommendation, we further investigate the PD 
algorithm with heterogenous initial resource distribution, 
called Heter-PD, which is controlled by two tunable pa- 
rameters. Comparing with all the mentioned algorithms 
in this paper, Heter-PD performs the best over all five 
evaluation metrics considered in this paper (see section 
3 for the definitions of evaluation metrics). Comparing 
Eq. m with Eq. [SJ we can find that if we assume that for 
user ui who has collected object O/j, the approximation 
E,{airkg^) ~ kg^ holds, namely the mean value of k^^ over 
all the objects having collected by user m always equals 
kg^ , PD is equivalent to the hybrid algorithm by setting 
e = A — 1. However, this assumption is too strong to be 
satisfied in reality. 

Note that, we didn't consider the preferential diffusion 
at the second step from the object side to the user side 
(PD-II for short). The main reason is that this method 
may lead to some illogical results. Considering the case 
that the target user Ui selected a very popular object 
Oa which is also selected by another user Uj who is as- 
sumed to be a new user of the system and only selected 



Oa. Via the PD-II method, Uj will obtain more resource 
from Oa than other users who also selected Oa , leading to 
the conclusion that Uj is more similar to Ui. Apparently 
this result is wrong, since a new user usually selects pop- 
ular objects, which is a common behavior in such kind 
of systems [24], and it is unreasonable to say this new 
user is more similar to the target user just according to 
such a common behavior. In addition, we have tested the 
performance of PD-II method. Comparing with standard 
NBI method, the improvement of accuracy (measured by 
ranking score) is very slight around 1% on MovieLens 
data and 0.6% on Netflix data. Therefore, we didn't 
consider this method for further analysis. 



III. DATA AND METRICS 

To test the algorithmic performance, we 
use two benchmark datasets. The MovieLens 
(http://www.grouplens.org/) data consists of 1682 
movies (objects) and 943 users who can vote for movies 
with five level ratings from 1 (i.e., worst) to 5 (i.e., 
best). The original data contains 10^ ratings. Here 
we only consider the ratings higher than 2. After 
coarse gaining the data contains 82520 user-object pairs. 
The Netflix data (http://www.netflixprize.eom/l is a 
random sampling of the whole records of user activities 
in Netflix.com. It consists of 10000 users, 6000 movies 
and 824802 links. Similar to the MovieLens data, only 
the links with ratings no less than 3 are considered. 
After data filtering, there are 701947 links left. To test 
the algorithmic performance, the data (i.e., known links) 
is randomly divided into two parts: The training set 
contains 90% of the data, and the remaining 10% 
of data constitutes the probe set . Notice that, any 
isolate object can not be recommended to users through 
the algorithms considered in this paper. Therefore to 
ensure the connectivity of the whole network, each time 
before moving a link to the probe set, we first check if 
this removal will result in isolate user or object, and 
we do not allow the removal that leads to unconnected 
nodes. 

Accuracy is the most important aspect to evaluate the 
recommendation algorithmic performance. A good al- 
gorithm is expected to give accurate recommendations, 
namely higher ability to find what the users like. Here 
we use Ranking Score J^j to measure the ability of a 
recommendation algorithm to produce a good ordering 
of objects that matches the user's preference. For a tar- 
get user, the recommender system will return a ranking 
list of all his uncollected object to him. For each hid- 
den user-object relation (i.e., the link in probe set), we 
measure the rank of this object in the recommendation 
list of this user. For example, if there are 1000 uncol- 
lected objects for user m, and object Oa is at 10th place, 
we say the position of this object is 10/1000, denoted by 
RSia = 0.01. A good algorithm is expected to give high 
ranks to the hidden objects, and thus leading to small 
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RS. Averaging over all the hidden user-object relations, 
we obtain the mean value of ranking score RS that can 
be used to evaluate the algorithm's accuracy, namely 



RS = 



1 



EP 



■ E 



RSi, 



(6) 



where ia denotes the probe link connecting Ui and Oq. 
Clearly, the smaller the ranking score, the higher the 
algorithm's accuracy, and vice versa. Since real users 
usually consider only the top part of the recommenda- 
tion list, a more practical measure may be to consider 
the number of user's hidden links contained in the top- 
L places. Therefore, we adopt another accuracy metric 
called Precision. For a target user Ui, the precision of 
recommendation, Pi(L), is defined as 



(7) 



where di{L) indicates the number of relevant objects 
(namely the objects collected by Ui in the probe set) in 
the top-L places of recommendation list. Averaging the 
individual precisions over all users with at least one hid- 
den link, we obtain the mean precision P{L) of the whole 
system. 

Besides accuracy, diversity is taken into account as an- 
other important aspect to evaluate the recommendation 
algorithm. There are two kinds of diversity. One is called 
Inter-Diversity which considers the uniqueness of differ- 
ent users' recommendation lists. Given two users Ui and 
Uj , the difference between their recommendation lists can 
be measured by the Hamming distance [2]| . 



(8) 



objects [26|. Here we adopt the widely used cosine sim- 
ilarity to measure object similarity. For two objects Oq, 
and their similarity is defined as 
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(10) 



Averaging li (L) over all users we obtain the mean intra- 
similarity I{L) for the system. A good recommendation 
algorithm is expected to give fruitful recommendations 
and has the ability to guide or help the users to exploit 
their potential interest fields, and thus leads to a lower 
intra-similarity (i.e., higher intra-diversity). 

High accurate recommendations might not be satisfied 
by the users. For example, recommending popular film 
Avatar to a user on MovieLens website is not always the 
best, because he/she might have already seen this film at 
the cinema. A diverse recommender system is expected 
to find the niche or unpopular objects that can not be 
easily known by other ways yet match users' preferences. 
The metric Popularity quantifies the capacity of an al- 
gorithm to generate novel and unexpected results, that 
is to say, to recommend less popular items unlikely to 
be already known about. The simplest way to calculate 
popularity is to use the average collected times over all 
the recommended items, as: 



ML) ^2 E 



(11) 



where is the recommendation list for user Ui. Clearly, 
lower popularity indicates higher novelty and surprisal. 
Averaging Ni{L) over all users we obtain the mean pop- 
ularity N{L) for the system. 



where Cij{L) is the number of common objects in the 
top-L places of both lists. Clearly, if ui and Uj have 
the same list, Hij{L) — 0, while if their lists are com- 
pletely different, Hij{L) = 1. Averaging Hij{L) over all 
pairs of users we obtain the mean distance H{L), for 
which greater or lesser values mean, respectively, greater 
or lesser personalization of users' recommendation lists. 
A good algorithm should not only give diverse recom- 
mendations among users (i.e., high inter-diversity), but 
also provide diverse recommendations for a single user 
(i.e., high intra-diversity) [13, H^. The latter can be 
measured by Intra- Similarity. For a target user u^, his 
recommended objects are {oi, 02, • • • , ol}, then the intra- 
similarity of Mi's recommendation list is defined as (22j : 



(9) 



where is the similarity between objects Oq, and op in 
Mi's recommendation list. There are many similarity in- 
dices that can be used to quantify the similarity between 



IV. RESULTS 

Summaries of the results for all algorithms and met- 
rics on MovieLens and Netflix datasets are shown re- 
spectively in Table U and Table HT] The so-called optimal 
parameters are subject to the lowest ranking score. And 
the other four metrics, namely precision, intra-similarity, 
hamming distance and popularity, are obtained at the 
optimal parameters. Clearly, PD outperforms Heter-NBI 
over all the five evaluation metrics. Among all four pre- 
vious algorithms, Re-NBI gives the highest accuracy by 
considering the high-order correlations between objects, 
while Hybrid-PH has the best performance on diversity 
and novelty. Comparing with these two outstanding al- 
gorithms, PD can reach or closely near the best accuracy 
without considering high-order correlation between ob- 
jects, and provide much more diverse results. By consid- 
ering the heterogenous initial resource distribution the 
algorithmic performance can be further improved. For 
example, in MovieLens Heter-PD decreases the rank- 
ing score to 0.081 with the parameters 9 = —0.25 and 
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TABLE I: Algorithmic performance for MovieLens data. The precision, intra-similarity, hamming distance and popularity are 
corresponding to L = 50. Heter-NBI is an abbreviation of NBI with heterogenous initial resource distribution, proposed in Ref. 
[2H . RE-NBI is an abbreviation of redundant-eliminated NBI, proposed in Ref. [22|. Hybrid-PH refers to the hybrid method 
which combines ProbS and HeatS algorithms. PD is an abbreviation of preferential diffusion method presented in this paper. 
Heter-PD is an abbreviation of PD with heterogenous initial resource distribution. The parameters (ranging in the interval 
[0,1] for Hybrid-PH and [-1,0] for the rest five algorithms with step 0.05) for the parameter-dependent algorithms are set as 
the ones corresponding to the lowest ranking scores (for Heter-NBI, Sopt ~ —0.80; for RE-NBI, rjo^t = —0.75; for Hybrid-PH, 
Aopt = 0.20; for PD, Sopt ~ —0.85; for Heter-PD, (S,£)opt = (-0.25,-0.8)). Each number is obtained by averaging over five 
runs with independently random division of training set and probe set. The entries corresponding to the best performance over 
all methods (except Heter-PD) are emphasized in black. 



Algorithms Ranking Score Precision Intra-Similarity Hamming Distance Popularity 



NBI 


0.106 


0.071 


0.355 


0.617 


233 


Heter-NBI 


0.101 


0.074 


0.340 


0.680 


220 


RE-NBI 


0.082 


0.085 


0.326 


0.788 


189 


Hybrid-PH 


0.085 


0.083 


0.296 


0.821 


167 


PD 


0.082 


0.084 


0.282 


0.847 


155 


Heter-PD 


0.081 


0.086 


0.278 


0.858 


153 



TABLE II: Algorithmic performance for Netflix data. The precision, intra-similarity, hamming distance and popularity are 
corresponding to L = 50. The parameters (ranging in the interval [0,1] for Hybrid-PH and [-1,0] for the rest five algorithms 
with step 0.05) for the parameter-dependent algorithms are set as the ones corresponding to the lowest ranking scores (for 
Heter-NBI, 9oft = -0.70; for RE-NBI, jyopt = -0.75; for Hybrid-PH, Aopt = 0.20; for PD, Soft = -0.85; for Heter-PD, 
(6',e)opt = (—0.2, —0.8)). Each number is obtained by averaging over five runs with independently random division of training 
set and probe set. The entries corresponding to the best performance over all methods (except Heter-PD) are emphasized in 
black. 



Algorithms Ranking Score Precision Intra-Similarity Hamming Distance Popularity 



NBI 


0.050 


0.050 


0.366 


0.424 


2366 


Heter-NBI 


0.047 


0.051 


0.341 


0.545 


2197 


RE-NBI 


0.039 


0.062 


0.336 


0.629 


2063 


Hybrid-PH 


0.045 


0.057 


0.311 


0.625 


1998 


PD 


0.041 


0.057 


0.295 


0.639 


1900 


Heter-PD 


0.040 


0.057 


0.266 


0.708 


1742 



e — —0.8, which is the lowest among all the methods re- 
ferred in this paper. Although with a heterogenous initial 
resource distribution, both accuracy and diversity can be 
improved, comparing with pure PD algorithm, such im- 
provements are less remarkable. This indicates that PD 
actually plays the main role of improvements. 

For PD algorithm, the dependence of parameter e on 
accuracy measured by ranking score is shown in figure [T] 
The optimal values of parameter e corresponding to the 
lowest ranking score on two datasets are both equal to 
0.85. Comparing with the standard case NBI, namely 
e = 0, the ranking score can be reduced by 23% for 
MovieLens and 18% for Netflix. We further investigate 
the dependence of ranking score {RS) on the object de- 
gree of four methods, namely NBI, Heter-NBI, Hybrid- 
PH and our method PD. The results are shown in fig- 
ure [21 Notice that, for a given x, its corresponding RS 
is obtained by averaging over all the objects whose de- 
grees are in the range of ( ^ ~^ log 5, log 5] . Insets 
show the RS against logarithm of x. It can be seen that 
the ranking score decreases with the increasing of the ob- 



ject degree for all these four algorithms. This indicates 
that in average popular objects can be more accurately 
recommended than the unpopular objects. The signif- 
icant differences of these four algorithms are embodied 
on their ability of accurately recommending unpopular 
objects. Clearly, PD works best for this task, and is fol- 
lowed by Hybrid-PH. Moreover, comparing the results of 
Heter-NBI with PD, we can see that although they both 
consider the preferential diffusion from user to object, 
considering at the first step (i.e., Heter-NBI) has much 
less effect on the unpopular objects than directly acting 
on the final step (i.e., PD). 

Figure [3] shows how the precision changes with the pa- 
rameter £ for four typical lengths of recommendation list. 
Given L there exists an optimal parameter e leading to 
the highest precision. Although this optimal parameter 
El is different from that subject to the lowest ranking 
score £2, the precision obtained with £2 is also consid- 
erably higher than that obtained by NBI. For example, 
when L = 50 with the optimal parameter corresponding 
to the lowest ranking score, the precision is prominently 
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FIG. 1: The ranking score RS vs. e. Each data point is 
obtained by averaging over five runs, each of which has an 
independently random division of training set and probe set. 
The optimal parameters e for MovieLens and Netflix, corre- 
sponding to the minimal RS, both equal to -0.85. 



improved by 18% and 14% for MovieLens and Netflix 
respectively. 

Hamming distance actually measures the ability that 
an algorithm give personalized recommendation. How 
the parameter e affects the Hamming distance is shown 
in figure U) Clearly, a smaller e leads to a higher Ham- 
ming distance (i.e., higher inter-diversity), and thus a 
more personalized recommendation. Comparing with the 
standard case NBI, given L = 50, Hamming distance can 
be enhanced by 37% for MovieLens and 56% for Netflix 
with optimal parameters corresponding to their respec- 
tive lowest ranking scores, even higher than the Hybrid- 
PH algorithm. As a result, our method has higher ability 
to find the niche (unpopular) objects that may be liked 
by users, and thus give a more personalized recommen- 
dation to the target user. To give more evidences, for a 
given algorithm we collect the top-L recommended ob- 
jects for each user. Denote by d the number of distinct 



FIG. 2: (Color online) The dependence of ranking score {RS) 
on the object degree. For a given x, its corresponding RS 
is obtained by averaging over all the objects whose degrees 
are in the range of {a{x^ — x),a{x^ + x)], where a is chosen 
as i log 5 for a better illustration. Insets show RS against 
logarithm of x. 



objects among all the recommended objects. Then we 
rank the d objects according to their recommended times, 
denoting hy Qi {i = 1, d), in decreasing order. The 
relationships between the objects' recommended times Q 
and their ranks are shown in figure [S] We have tested for 
many different L, and here take L = 50 and L — 100 as 
typical examples. Two important phenomena can be ob- 
tained from figure[S] Firstly, comparing three algorithms, 
NBI, Hybrid-PH with A = 0.2 and PD with e = -0.85, 
we have dpo > dnybrid > dNBi- That is to say PD pro- 
vides larger number of distinct objects to users than NBI 
and Hybrid-PH. For example, when the length of recom- 
mendation list is 50, in MovieLens data, NBI can only 
recommend 293 distinct objects, Hybrid-PH can recom- 
mend 787 distinct objects, while PD increases this num- 
ber to more than 1000. In Netflix data, for the case 
L = 50, more than 5000 distinct objects can be recom- 
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FIG. 3: (Color online) The dependence of precision on pa- 
rameter £. Each data point is obtained by averaging over 
five independent runs with data division identical to the case 
shown in figure [T] The vertical dotted line indicates the opti- 
mal parameter e subject to the lowest ranking score. 



mended through PD algorithm, namely almost every ob- 
ject has the chance to be recommended. Secondly, the 
curves for NBI are remarkably steeper than these from 
Hybrid-PH and PD. Take the MovieLens data for exam- 
ple (the case L = 50), with NBI algorithm, six movies 
are recommended over six hundred times. Since there 
are only 943 users in this dataset, it means that each of 
these movies is recommended to more than two-thirds of 
the users. The result with Hybrid-PH is much better, the 
No.l object is recommended 341 times. However, com- 
paring with Hybrid-PH, see the insets of figure [5l PD 
performs better, which indicates that with PD algorithm 
users are more likely to be recommended with different 
objects, namely PD can provide more personalized rec- 
ommendations. 

Another metric to measure the algorithm's diversity is 
intra-similarity. Different from Hamming distance, intra- 
similarity measures the ability that an algorithm provides 
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FIG. 4: (Color online) The Hamming distance vs. e. Each 
data point is obtained by averaging over five independent runs 
with data division identical to the case shown in figure [T] The 
vertical dotted line indicates the optimal parameter e subject 
to the lowest ranking score. 



diverse recommendations for a single user. The depen- 
dence of intra-similarity on parameter e is shown in fig- 
ure [6l It shows that the parameter e is positively corre- 
lated with intra-similarity, namely the smaller s the lower 
intra-similarity (i.e., higher intra-diversity) . Comparing 
with NBI, when L = 50, intra-similarity can be decreased 
by 21% for MovieLens and 23% for Netflix with opti- 
mal parameters corresponding to their respective lowest 
ranking scores. Even comparing with the Hybrid-PH al- 
gorithm, the improvement can reach up to 5% for both 
datasets. This claims that our method is effective to gen- 
erate more fruitful recommendations. Furthermore, we 
investigate how the two parameters (e, L) affect intra- 
similarity. The intra-similarity / in (e, L) plane for two 
datasets are shown in figure [T] The dashed line indicates 
the intra-similarity of the system which is obtained by 
averaging over all the object pairs. Thus the intra- 
similarity as obtained from (e, L) on the dashed line is 
equal to that of L randomly chosen objects from the sys- 
tem. The left region has lower intra-similarity while the 
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FIG. 5: (Color online) The relationship between the recommended times Q of objects and their ranks for two datasets. Insets of 
two MovieLens sub-figures show the results of top-200 frequently recommended objects. Insets of two Netflix sub-figures show 
Q against logarithm of x (i.e., Rank). For NBI, only the objects inside the blue region have the chance to be recommended. 



right region has higher intra-similarity. As a metaphor, 
one can think the dashed line as a plane lens keeping the 
same size of the user's vision. And in the left region es- 
pecially the area corresponding to smaller e and larger 
L, the algorithm is like a concave lens that broadens the 
user's vision, while in the right region corresponding to 
larger e and smaller L, the algorithm is like a convex 
lens that narrows user's vision. The focal length is deter- 
mined by parameter e. A smaller e in the left region indi- 
cates a smaller focal length for concave lens, and hence a 
broader view, while in the right region indicates a larger 
focal length for convex lens, hence a narrow view. 



In figure [H we report the dependence of popularity on 
parameter e. Similar with intra-similarity, a smaller e 
yields a smaller popularity P, and thus a more novel rec- 
ommendation. Comparing with the NBI, popularity can 
be remarkably improved by 33% and 23% for MovieLens 
and Netflix datasets. Even comparing with the Hybrid- 
PH algorithm, the improvement can reach 7% and 5% 
respectively. 



V. EFFECTS OF DATA SPARSITY 

In this section, we investigate the effects of data spar- 
sity on the algorithmic performance. Since Hybrid-PH is 
the most similar algorithm with our method, we choose 
it for comparison (although RE-NBI is more accurate, 
it considers the high-order correlations between objects). 
We investigate the effects of data sparsity on the algorith- 
mic performance in two ways: (i) For the whole dataset, 
we select p% (ranging from 10% to 90% with step 10%) 
links as training set, and the rest (100— p)% links consti- 
tute the probe set. Clearly, lower p indicates sparser data 
(i.e., less information), (ii) Given a 90%-10% division of 
training set and probe set, we randomly choose p% of 
the known links in the prepared training set to predict 
the links in probe set. To do this, the probe links keep 
unchanged. For example, p =10 means that we actually 
use 9% of the whole dataset to predict the links in probe 
set which contains 10% links of the whole dataset. Lower 
p indicates sparser data. The numerical results on two 
datasets are shown in figurelDfor method (i) and figurefTUl 
for method (ii) . Each point is obtained with the optimal 
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FIG. 6: (Color online) The intra-similarity as a function of 
e. Each data point is obtained by averaging over five inde- 
pendent runs with data division identical to the case shown 
in figure [T] The vertical dotted line indicates the optimal 
parameter e subject to the lowest ranking score. 



parameter subject to the lowest ranking score. From fig- 
ure m it can be seen that the ranking score decreases 
with the increasing of the size of the training set, which 
agrees with the intuition that we can obtain better rec- 
ommendation with more information. Furthermore the 
optimal parameters of both methods decrease with the 
increasing of p for both methods. It shows that when 
training set contains 10% links, the optimal parameters 
are A = 1 for Hybrid-PH and e = for PD, which are 
all corresponding to the standard case NBI. Insets show 
the RS-improvement of PD comparing with Hybrid-PH, 
which is defined as 



Improvement 



RS 



PD 



RS 



(12) 



Hybrid 



where RS* indicates the lowest ranking score for a given 
training and probe set division. Generally speaking, the 
i?S'-improvement increases with the increasing of the size 
of training set. That is to say, PD performs much better 
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FIG. 7: (Color online) The intra-similarity / in (e, L) plane 
for two datastes. The numerical simulation run over the pa- 
rameter L in the interval [10,100] with step length equal to 
10, and the parameter e in the interval [-1,0] and [-1.2,0] for 
MovieLens and Netfltx respectively, with step 0.05. All the 
results are obtained by averaging over five independent runs 
with data division identical to the case shown in figure [T] 
The dashed line indicates that with the parameter combina- 
tion (e, L) on this line the intra-similarity equals the value of 
the system. 



than Hybrid-PH for denser datasets. The qualitative be- 
haviors in figure [TU] are the same as what we obtained 
in figure [9l which further demonstrates that PD can 
give much better predictions than Hybrid-PH for denser 
datasets. 



VI. CONCLUSION AND DISCUSSION 

The preferential diffusion proposed in this paper is a 
kind of biased random walk taking into account the het- 
erogeneity of users' degrees. The present process indeed 
defines a new local index of similarity in bipartite net- 
works (like the original NBI algorithm is correspondirig to 
the so-called resource-allocation similarity index p6l[27|) 
and thus it has potential applications in similarity-based 
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FIG. 8: (Color online) The dependence of popularity (i.e., av- 
erage degree) on e. Each data point is obtained by averaging 
over five independent runs with data division identical to the 
case shown in figure [T] The vertical dotted line indicates the 
optimal parameter e subject to the lowest ranking score. 



link prediction [281 129| , community detection [305 1 node 
classification [3l|, and so on. The biased random vifalk 
itself has already found extensive application in many 
branches of science and engineering, including detecting 
the navigation rules on complex network (3^ , quantifying 
the centrality of vertex and edge [33*1 , modeling the ani- 
mal movements 13411 and information discovery in wireless 
sensor networks |35| . Here we applied the biased random 
walk in dealing with the information filtering process, 
which may also broaden the understanding of the appli- 
cability of biased random walk 

Accuracy metrics have been widely used to evaluate 
the performance of recommendation algorithms and con- 
sidered to be the most important factor. For example, 
the Netflix Prize [s^ focuses only on accuracy. However, 
user satisfaction is not always correlated with high rec- 
ommendation accuracy j25l ISTj . The recommendations 
on popular objects (those are more easily to be found 



FIG. 9: (Color online) The ranking score changes with the 
size of training set measured by the percentage of the whole 
data set. That is to say, we change the size of training set from 
10% to 90% to respectively predict the rest 90% to 10%. Each 
data point is obtained with the parameter (e £ [—1,0] for PD 
and A G [0, 1] for Hybrid-PH with step 0.05) subject to the 
lowest RS. The optimal parameters are labelled in black for 
PD and red for Hybrid-PH. Insets show the i?S'-improvement 
of PD comparing with Hybrid-PH against the size of training 
set. 



in other channels) are less likely to excite users. On 
the contrary, the unexpected and fortuitous recommen- 
dations which are usually related with cold objects are 
more favorable. Such serendipity recommendation will 
improve user experience and thus enhance their loyalty 
to the system. In order to provide accurate as well as 
diverse and novel recommendations, in this paper, moti- 
vated by the perspective of physics, we proposed an algo- 
rithm, named PD, based on preferential diffusion process 
on bipartite networks. We tested our algorithm on two 
benchmark datasets, MovieLens and Netflix, and applied 
five metrics, from the aspects of accuracy, diversity and 
novelty, to evaluate the algorithmic performance. Com- 
paring with the standard algorithm NBI, the accuracy 
measured by ranking score can be further improved by 
23% for MovieLens and 18% for Netflix. Even compar- 
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FIG. 10: (Color online) The ranking score changes with the 
size of training set measured by the percentage of the 90% 
training set. That is to say, given a 90%-10% division of 
training set and probe set, we randomly choose p% of the 
known links in training set to predict the links in the un- 
changed probe set. Each data point is obtained with the pa- 
rameter (e G [-1, 0] for PD and A € [0, 1] for Hybrid-PH with 
step 0.05) subject to the lowest RS. The optimal parameters 
are labelled in black for PD and red for Hybrid-PH. Insets 
show the _RS'-improvement of PD comparing with Hybrid-PH 
against the size of training set. 



ing with the state-of-the-art algorithm, Hybrid-PH, the 
improvement can reach 4% for MovieLens and 9% for 
Netflix. Moreover, the performance of PD can be further 
improved by considering a heterogenous initial resource 
configuration. 

Furthermore, statistical result on the ranking score 
of individual objects shows that our method has much 
higher ability to accurately recommend the low-degree 
objects. That is to say, such prominent improvement 
on accuracy comes mainly from the highly accurate rec- 
ommendation on unpopular objects, and thus it indeed 
enhances the recommendation diversity and novelty. For 
example, if we recommend 50 objects to each user, in 
MovieLens, NBI can only recommend 293 distinct objects 
to all users, Hybrid-PH can recommend 787 distinct ob- 



jects, while PD increases this number to more than 1000. 
In Netflix data, more than 5000 distinct objects can be 
recommended through PD algorithm, namely almost ev- 
ery object has the chance to be recommended. Specially, 
we found that the recommender system may play differ- 
ent roles from the aspect of intra-similarity — the sim- 
ilarity within a user's recommendation list, which is de- 
termined by the algorithm's parameter e and the length 
of recommendation list L. Given (e, L), if the intra- 
similarity generated by algorithm is higher than that of 
L randomly selected objects (i.e., average intra-similarity 
of the whole system), the recommender system plays the 
role as a convex that narrows users' vision, whereas if 
intra-similarity generated by algorithm is lower than that 
of the system, the recommender system plays the role as 
a concave that broadens users' vision. Besides, we inves- 
tigated the dependence of algorithm performance on data 
density. The results show that comparing with Hybrid- 
PH, PD algorithm gives more significant improvement 
for denser data. 

A good recommendation algorithm can guide the sys- 
tem for a better development. You can think that the 
system itself and the recommendation algorithm consti- 
tute a symbiotic system. Generally speaking, there is no 
best recommendation algorithm, but the most suitable 
algorithm for a given system or a user. Just like the mar- 
riage game [38)]: choose the right but not the best. In this 
sense, the most equitable evaluation on recommendation 
algorithm should be based on the user experience which 
is difficult to capture in metric. Notice that, the optimal 
algorithm (or parameter) for the whole system is usually 
different from the optimal algorithm (or parameter) for 
an individual user. Thus an applicable and feasible way 
is building an open recommender system where users can 
help themselves to find their best experienced algorithm 
(or parameter). For example, we can set a bar control- 
ling the parameter of the algorithm on the website. Take 
the PD algorithm as an example, the user may set large 
value of e to obtain recommendations of popular and hot 
items, and set small value of e to obtain recommenda- 
tions of niche and novel items. Here we argue that the 
design of user-centric recommender systems will become 
one of the challenges of the next generation information 
filtering techniques. Finally, we believe that this paper 
may shed some light on this interesting and exciting di- 
rection. 
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